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SEQUENCE LISTING 

) GENERAL INFORMATION: 

(i) APPLICANT: Life Technologies, Inc. 

8 717 Grovemont Circle / 
Gaithersburg, MD 20884-9980 
APPLICANTS/INVENTORS: ChatterjeZ, Deb K. 

(ii) TITLE OF INVENTION : Cloned d4 Polymerases from totcga 

and Muj/ants Thereof 

(iii) NUMBER OF SEQUENCES: 22 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: STeW KESSLER, GOLDSTEIN & FOX P L L C 

B STREET: 1100 Ne/ York Ave., N.W., Suite 600 

(C) CITY: Washingtj&i 

(D) STATE: DC 

(E) COUNTRY: USA y 

(F) ZIP; 20005 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM Tv£>E: Floppy disk 

(B) COMPUTER: : IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: Patent In Release #1.0, Version #1.3 0 

(yi) _CURRENT__A^PPL I CATION -DATA: 

(A) APPLICATION NUMBER: To be assigned 

(B) FILING DATE: Herewith 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A/ APPLICATION NUMBER: 08/525,057 
(B) FILING DATE: 08-SEP-1995 
<tC) CLASSIFICATION: 

PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/537,397 

(B) FILING DATE: 02 -OCT- 1995 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/537,400 

(B) FILING DATE: 02 -OCT- 199 5 

(C) CLASSIFICATION: 
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(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/576,759 

(B) FILING DATE: 21-DEC-1995 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: To be assigned 

(B) FILING DATE: 14-AUG-1996 ' 

(C) CLASSIFICATION: 



(viii) ATTORNEY/ AGENT INFORMATION: 

(A) NAME: Esmond, Robert W. 

(B) REGISTRATION NUMBER: 3^,893 

(C) REFERENCE/DOCKET NUMBER : 0942.280PC03 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 202-371-2600 

(B) TELEFAX: 202-371-2^40 



(2) INFORMATION FOR SEQ ID NO:> 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 23 bas4 pairs 

(B) TYPE: nucleic/ acid 

(C) S TRANDEDNE S S/i both 

(D) TOPOLOGY: b/th 

( i i )_ -MOLECULE -TYPE : /cDNA 



(xi) sequence Description : seq id no : i : 

GAGCTCACGG GGGATGfcAGG AAA 
(2) INFORMATION^ FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

km LENGTH: 2682 base pairs 
'S) TYPE: nucleic acid 
'(C) STRANDEDNESS : both 
(D) TOPOLOGY: both 

(ij/) MOLECULE TYPE: cDNA 



23 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 
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ATGG CG AG AC TATTTCTCTT TGATGGCACA GCCCTGGCCT ACAGGGCATA TTACGCCCTC 60 

GACAGATCCC TTTCCACATC CACAGGAATT CCAACGAACG CCGTCTATGG GGTTGCCAGG 12 0 

ATGCTCGTTA AATTCATTAA GGAACACATT ATACCCGAAA AGGACTACGc/ GGCTGTGGCC 180 

TTCGACAAGA AGGCAGCGAC GTTCAGACAC AAACTGCTCG TAAGCGAGAA GGCGCAAAGG 240 

CCAAAGACTC CGGCTCTTCT AGTTCAGCAG CTACCTTACA TCAAGCGGCT GATAGAAGCT 300 

CTTGGTTTCA AAGTGCTGGA GCTGGAGGGA TACGAAGCAG ACGMATCAT CGCCACGCTT 360 

GCAGTCAGGG CTGCACGTTT TTTGATGAGA TTTTCATTAA TA^CCGGTGA CAAGGATATG 420 

CTTCAACTTG TAAACGAGAA GATAAAGGTC TGGAGAATCG TCAAGGGGAT ATCGGATCTT 480 

GAGCTTTACG ATTCGAAAAA GGTGAAAGAA AGATACGGTg/tGGAACCACA TCAGATACCG 540 

GATCTTCTAG CACTGACGGG AGACGACATA GAC^CAt/c CCGGTGTAAC GGGAATAGGT 600 

GAAAAGACCG CTGTACAGCT TCTCGGCAAG TATAGA^ATC TTGAATACAT TCTGGAGCAT 660 

GCCCGTGAAC TCCCCCAGAG AGTGAGAAAG GCTCTCTTGA GAGACAGGGA AGTTGCCATC 72 0- 

CTCAGTAAAA AACTTGCAAC TCTGGTGACG AAOBCACCTG TTGAAGTGGA CTGGGAAGAG 780 

ATGAAATACA GAGGATACGA CAAGAGAAAA CTACTTCCGA TATTGAAAGA AC TGGAGTTT 840 

GCTTCCATCA TGAAGGAACT TCAACTGTAC (3AAGAAGCAG AACCCACCGG ATACGAAATC 900 

GTGAAGGATC ATAAGACCTT CGAAGATCTCZ ATCGAAAAGC TGAAGGAGGT TCCATCTTTT 960 

GCCCTGGACC TTGAAACGTC CTCCTTGGAC CCGTTCAACT GTGAGATAGT CGGCATCTCC 1020 

GTGTCGTTCA AACCGAAAAC AGCTTATTAC ATTCCACTTC ATCACAGAAA CGCCCACAAT 108 0 

CTTGATGAAA CACTGGTGCT GTCG^AGTTG AAAGAGATCC TCGAAGACCC GTCTTCGAAG 1140 

ATTGTGGGTC AGAACCTGAA GTACGACTAC AAGGTTC TTA TGGTAAAGGG TATATCGCCA 12 00 

GTTTATCCGC ATTTTGACAC GATGATAGCT GCATATTTGC TGGAGCCAAA CGAGAAAAAA 1260 

TTCAATCTCG AAGATCTGTc/tTTGAAATTT CTCGGATACA AAATGACGTC TTATCAGGAA 132 0 

CTGATGTCGT TTTCCTCACC ACTTTTTGGT TTCAGCTTTG CGGATGTTCC GGTAGACAAG 13 80 

GCTGCCGAAT ACTCCT<3CGA GGATGCAGAC ATCACTTATA GGCTCTACAA GATACTCAGC 1440 

ATGAAGCTCC ATGi^fcCGGA ACTTGAGAAC GTCTT CT AC A GGATAGAGAT GCCGTTGGTG 1500 

AACGTCTTGG CACZGAATGGA ATTCAACTGG GTGTATGTTG ACACAGAATT CCTGAAAAAG 156 0 

CTCTCGGAGG AT3TACGGCAA AAAGCTCGAG GAACTGGCCG AAAAAATCTA CCAGATAGCA 162 0 
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GGTGAGCCCT TCAACATCAA TTCTCCAAAA CAGGTTTCAA ACATCCTTTT TG^GAAGCTG 
GGAATAAAAC CCCGTGGAAA AACGACAAAA ACAGGAGATT ACTCTACCAg/gaTAGAGGTG 
TTGGAAGAGA TAGCGAATGA GCACGAGATA GTACCCCTCA TTCTCGAG^T CAGAAAGATC 
CTGAAACTGA AATCGACCTA CATAGACACC CTTCCGAAAC TTGTG^ACCC GAAAACCGGA 
AGATTTCATG CATCTTTCCA CCAGACGGGT ACCGCCACTG GCAC&TTGAG TAGCAGTGAT 
CCAAATCTTC AGAATCTTCC GACAAAGAGC GAAGAGGGAA A^AAATTAG AAAAGCGATT 
GTGCCCCAGG ATCCAGACTG GTGGATCGTC AGTGCGGATT /tTCCCAAAT AGAACTCAGA 
ATCCTCGCTC ATCTCAGTGG TGATGAGAAC CTTGTGAAGG CCTTCGAGGA GGGCATCGAT 
GTGCACACCT TGACTGCCTC CAGGATCTAC AACGTAA^GC CAGAAGAAGT GAACGAAGAA 
ATGCGACGGG TTGGAAAGAT GGTGAACTTC TCTAtXaT ACGGTGTCAC ACCGTACGGT 
CTTTCTGTGA GACTTGGAAT ACCGGTTAAA GAAgAgAAA AGATGATTAT CAGCTATTTC 
ACACTGTATC CAAAGGTGCG AAGCTACATC CAg4gGTTG TTGCAGAGGC AAAAGAGAAG 
GGCTACGTCA GGACTCTCTT TGGAAGAAAA A^GATATTC CCCAGCTCAT GGCAAGGGAC 
AAGAACACCC AGTCCGAAGG CGAAAGAATC yfecGATAAACA CCCCCATTCA GGGAACTGCG 
GCAGATATAA TAAAATTGGC TATGATAGa/ ATAGACGAGG AGCTGAGAAA AAGAAACATG 
AAATCCAGAAT TGATC^TTCA ^GTTCAT^AC GAACTGGTOT TCG^^GTTCC^ CGATGAGGAa" 
AAAGAAGAAC TAGTTGATCT GCTGA^AAC AAAATGACAA ATGTGGTGAA ACTCTCTGTG 
CCTCTTGAGG TTGACATAAG catcCgaaaa AGCTGGTCTT ga 
(2) INFORMATION FOR SEQ Ii/nO:3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: &I3 amino acids 

(B) TYPE: am/no acid 

(C) STRANDE©NESS : single 

(D) TOPOLOC&: linear 

(ii) MOLECULE TOPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

Met Ala Arg Leu Phe Leu Phe Asp Gly Thr Ala Leu Ala Tyr Arg Ala 
' 5 " 15 



1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340* 

2400 

2460 

2520 

2580 

2640 

2682 



Tyr Tyr Ala Leu Asp Arg Ser Leu Ser Thr Ser Thr Gly lie/Pro Thr 
20 25 3( 

Asn Ala Val Tyr Gly Val Ala Arg Met Leu Val Lys Phe/le Lys Glu 
35 40 4 ; 

His lie He Pro Glu Lys Asp Tyr Ala Ala Val Ala/he Asp Lys Lys 
50 55 60 / Y 

Ala Ala Thr Phe Arg His Lys Leu Leu Val Ser Zp Lys Ala Gin Arg 
65 70 75 / 80 9 

Pro Lys Thr Pro Ala Leu Leu Val Gin Gin X Pro Tyr He Lys Ara 
85 9 o / 9 \ 

Leu He Glu Ala Leu Gly Phe Lys Val Le/ Glu Leu Glu Gly Tyr Glu 
100 105 / 110 

Ala Asp Asp He He Ala Thr Leu Ala/val Arg Ala Ala Arg Phe Leu 
115 120 / 125 

Met Arg Phe Ser Leu He Thr Gly a/ P Lys Asp Met Leu Gin Leu Val 
130 135 / 140 

Asn Glu Lys He Lys Val Trp Arc/ He Val Lys Gly He Ser Asp Leu 

150 / 155 160 

Glu Leu Tyr Asp Ser Lys Lys Val Lys Glu Arg Tyr Gly Val Glu Pro 



170 



175 



His "Gin He Pro-Asp Leu Le^Ala Leu Thr Gly Asp Asp He Asp Asn 
180 , las 19Q 

He Pro Gly Val Thr Gly/lle Gly Glu Lys Thr Ala Val Gin Leu Leu 
195 / 200 205 

Gly Lys Tyr Arg Asn l/eu Glu Tyr He Leu Glu His Ala Arg Glu Leu 
210 / 215 220 

Pro Gin Arg Val Ar& Lys Ala Leu Leu Arg Asp Arg Glu Val Ala He 
225 / 230 235 240 

Leu Ser Lys Lys/ Leu Ala Thr Leu Val Thr Asn Ala Pro Val Glu Val 

/ 245 



250 



255 



Asp Trp Glu Glu Met Lys Tyr Arg Gly Tyr Asp Lys Arg Lys Leu Leu 

265 270 



/26O 



Pro lie Leu Lys Glu Leu Glu Phe Ala Ser lie Met Lys Glu Leu Gin 
2 / 75 280 285 



Leu Tyr/ciu Glu Ala Glu Pro Thr Gly Tyr Glu He Val Lys Asp His 
29gf 295 300 
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Lys Thr Phe Glu Asp Leu lie Glu Lys Leu Lys Gl/ Val Pro Ser Phe 
305 310 315 / 320 

Ala Leu Asp Leu Glu Thr Ser Ser Leu Asp Pro/phe Asn Cys Glu lie 
325 330 / 335 

Val Gly lie Ser Val Ser Phe Lys Pro Lys Tnr Ala Tyr Tyr lie Pro 
340 345 / 350 

Leu His His Arg Asn Ala His Asn Leu Asi/ Glu Thr Leu Val Leu Ser 
355 360 / 365 

Lys Leu Lys Glu He Leu Glu Asp Pro ^er Ser Lys He Val Gly Gin 
370 375 / 380 

Asn Leu Lys Tyr Asp Tyr Lys Val Le^ Met Val Lys Gly ile'Ser Pro 
385 390 / 395 400 

Val Tyr Pro His Phe Asp Thr Met ile Ala . Ala Tyr Leu Leu Glu Pro 
405 / 410 415 

Asn Glu Lys Lys Phe Asn Leu Glu 7 Asp Leu Ser Leu Lys Phe Leu Gly 
420 / 425 430 

Tyr Lys Met Thr Ser Tyr Gin <Au Leu Met Ser Phe Ser Ser Pro Leu 
435 &40 445 

Phe Gly Phe Ser Phe Ala Asp/ Val Pro Val Asp Lys Ala Ala Glu Tyr 
450 45? 460 



Ser Cys Glu Asp Ala Asp lie Thr Tyr Arg Leu Tyr Lys lie Leu Ser 
465 470 / 475 480 

Met Lys Leu His Glu Ala/siu Leu Glu Asn Val Phe Tyr Arg He Glu 
485 / 490 495 

Met Pro Leu Val Asn vdl Leu Ala Arg Met Glu Phe Asn Trp Val Tyr 
500 / 505 5io 

Val Asp Thr Glu Phe /Leu Lys Lys Leu Ser Glu Glu Tyr Gly Lys Lys 
515 / 520 525 



Leu Glu Glu Leu Al/a Glu Lys He Tyr Gin He Ala Gly Glu Pro Phe 
530 J 535 540 



Asn He Asn Ser iro Lys Gin Val Ser Asn He Leu Phe Glu Lys Leu 
545 / 550 555 



560 



Gly He Lys Pr6 Arg Gly Lys Thr Thr Lys Thr Gly Asp Tyr Ser Thr 
565 570 575 

Arg He Glu Val Leu Glu Glu He Ala Asn Glu His Glu He Val Pro 
/580 585 590 
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Leu lie Leu Glu Phe Arg Lys lie Leu Lys Leu Ir/k Ser Thr Tyr lie 
595 600 / 605 

Asp Thr Leu Pro Lys Leu Val Asn Pro Lys Thr yfely Arg Phe His Ala 
610 615 /620 

Ser Phe His Gin Thr Gly Thr Ala Thr Gly Ai/g Leu Ser Ser Ser Asp 
625 630 035 640 

Pro Asn Leu Gin Asn Leu Pro Thr Lys Ser/Glu Glu Gly Lys Glu lie 
645 650 655 

Arg Lys Ala lie Val Pro Gin Asp Pro A^p Trp Trp lie Val Ser Ala 
660 665 / 670 

Asp Tyr Ser Gin lie Glu Leu Arg lie 7 Leu Ala His Leu Ser "Gly Asp 
675 680 / 685 

Glu Asn Leu Val Lys Ala Phe Glu ^lu Gly lie Asp Val His Thr Leu 
690 695 / 700 

Thr Ala Ser Arg lie Tyr Asn Val Lys Pro Glu Glu Val Asn Glu Glu 
705 710 / 715 720 

Met Arg Arg Val Gly Lys Met X?al Asn Phe Ser lie lie Tyr Gly Val 
725 / 730 735 

Thr Pro Tyr Gly Leu Ser Va/l Arg Leu Gly lie Pro Val Lys Glu Ala 
740 / 745 750 

Glu Lys Met lie He Ser fcyr Phe Thr Leu Tyr Pro Lys Val Arg Ser 
755 / 760 765 

Tyr He Gin Gin Val VaA Ala Glu Ala Lys Glu Lys Gly Tyr Val Arg 
770 / 775 780 

Thr Leu Phe Gly Arg Lys Arg Asp He Pro Gin Leu Met Ala Arg Asp 
785 /790 795 800 

Lys Asn Thr Gin Serf Glu Gly Glu Arg He Ala He Asn Thr Pro He 
8Q5 810 815 

Gin Gly Thr Ala ilia. Asp He He Lys Leu Ala Met He Asp He Asp 
820 / 825 830 

Glu Glu Leu Ar^ Lys Arg Asn Met Lys Ser Arg Met He He Gin Val 
835 / 840 845 



His Asp Glu Lefu Val Phe Glu Val Pro Asp Glu Glu Lys Glu Glu Leu 
850 / 855 860 



Val Asp Leu Val Lys Asn Lys Met Thr Asn Val Val Lys Leu Ser Val 
865 / 870 875 880 
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- «« V,X £ 1U s „ Ile ^ ^ 

890 

(2) INFORMATION FOR SEQ id NO:4: 

(i) SEQUENCE CHARACTERISTICS • 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: sinqle 
TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(Xi) SEQUENCE DESCRIPTION: SEQ in/ NO:4: 

r° ser phe Aia ^ ^ U G1 7 Thr ser ser 

(2) INFORMATION FOR SEQ ID NO : 5: 

(i) SEQUENCE CHARACTERISTIC/ 

(A) LENGTH: ii amino afcds 

TYPE: amino acid / 
(C) STRANDEDNESS: single 
TOPOLOGY: linear/ 

(ii) MOLECULE TYPE : peptide 



(Xi) SEQUENCE DESCRIPj/xoN: SEQ 1 D NO :5 : 

I" 0 ^ 5 Ph / ASP ^ G1U Asp ser 

/ 10 
(2) INFORMATION FOR SE / ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS- 

A) LENGTH/ ii amino ac . ds 

(B) TYPE .-/amino acid 

(C) STRANDEDNESS: single 
TOPq£ogy: l inear 



(ii) MOLECULE 7 TYPE: peptide 



(Xi) SEQu/ N CE DESCRIPTION: SEQ ID NO:6: 
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Gly Pro Val Ala Phe Asp Ser Glu Thr Ser Al^ 
1 5 io 

(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: S5EQ ID NO : 7 : 
Met He Val Ser Asp He (Am Ala Asn Ala 



1 5 
(2) INFORMATION FOR SEQ ID J^O : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26/base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOG^ linear 

(ii) MOLECULE TYPE: cDNA 



10 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 ; 

GACGTTTCAA GCGCTAGGGC AAAAGA 

(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(AO LENGTH: 10 amino acids 
(B) TYPE: amino acid 
JC) STRANDEDNESS: single 
f(D) TOPOLOGY: linear 

(ii) /MOLECULE TYPE: peptide 



/ 




(k±) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 
Lys Met Val Asn Phe Ser He He Tyr Gly 



26 



1 5 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Lys Ala lie Asn Phe Gly L6u lie Tyr Gly 
1 5/10 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 atfcino acids 

(B) TYPE: amino /acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Lys Ala lie Thtr Phe Gly lie Leu Tyr Gly 
1 S 10 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) JTYPE: amino acid 

(C) /STRANDEDNESS: single 

(D) / TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 



(xi) 
Lyi 



SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
Thr Phe lie Tyr Gly Phe Leu Tyr Gly 
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1 5 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 amino acic 

(B) TYPE: amino acid 

(C) STRANDEDNESS : sing] 

(D) TOPOLOGY: linear ' 

(ii) MOLECULE TYPE: peptide* 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 13 : 

Lys Thr He Asn Phe Ly Val Leu Tyr Gly 
1 5 / 10 

(2) INFORMATION FOR SE(/lD NO : 14 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH / 31 base pairs 

(B) TYPE : Aucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE/TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO : 14 : 
GTATATTATA GAfeTAGTTAA CCATCTTTCC A 
(2) INFORMATION FOR SEQ ID NO:15 : 

(i) SEQUENCE CHARACTERISTICS- 
'(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 




(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:l 5: 
gtAggccagg GGCTGTGCCG GCAAAGAGAA ATAGTC 



-73- 



(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 3 5 base pairs 
,(B) TYPE: nucleic acid 
(C) STRANDEDNESS : single j 
<D) TOPOLOGY: linear / 

(ii) MOLECULE TYPE: cDNA / 

(xi) SEQUENCE DESCRIPTION: SEQ IDrNO:16: 
GAAGGATATC CTTGGCGCCG GTTATTATGA AAATC 
(2) INFORMATION FOR SEQ ID NO: 17: / 

(i) SEQUENCE CHARACTERISTICS^: 

(A) LENGTH: 1310 base Zairs 

(B) TYPE: nucleic acid^ 

(C) STRANDEDNESS : botth 

(D) TOPOLOGY: linear/ 

(xi) SEQUENCE DES CRI PT ION: SEQ ID NO : 1 7_: 

ATGGCGAGAC TATTTCTCTT TGAFGGCACA GCCCTGGCCT 

GACAGATCCC TTTCCACATC C^CAGGAATT CCAACGAACG 

ATGCTCGTTA AATTCATTAA (SGAACACATT ATACCCGAAA 

TTCGACAAGA AGGCAG CG Ay GTTCAGACAC AAACTGCTCG 

CCAAAGACGC CGGCTCTTOT AGTTCAGCAG CTAC CTTACA 

CTTGGTTTCA AAGTGCTJGGA GCTGGAAGGG TACGAAGCAG 

GCAGCAAAGG GCTGCJCCGTT TTTTGATGAG ATTTTCATAA 

CTTCAACTTG TAAACGAGAA GATAAAGGTC TGGAGAATCG 

GAGCTTTACG ATTCGAAAAA GGTGAAAGAA AGATACGGTG 

GATCTTCTAG CACTGACGGG AGACGACATA GACAACATTC 
GAAAAGACCG yCTGTACAGCT TCTCGGCAAG TATAGAAATC 

GCCCGTGAAG TCCCCCAGAG AGTGAGAAAG GCTCTCTTGA 



ACAGGGCATA TTACGCCCTC 60 

CCGTCTATGG CGTTGCCAGG 12 0 

AGGACTACGC GGCTGTGGCC 180 

TAAGCGACAA GGCGCAAAGG 24 0 

TCAAGCGGCT GATAGAAGCT 300 

ACGATATCAT CGCCACGCTT 360 

TAACCGGTGA CAAGGATATG 420 

TCAAGGGGAT ATCGGATCTT 480 

TGGAACCACA TCAGATACCG 540 

CCGGTGTAAC GGGAATAGGT 600 

TTGAATACAT TCTGGAGCAT 660 

GAGACAGGGA AGTTGCCATC 72 0 
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^ WflC0A / 



— *cr r- —r:r/— 

^ «zr ^ 

«- TC jr^ c — «cr 

U) SEQUENCE CHan 

(B) Typ b . 436 amino; 

■ linear/ 



780 
840 
900 
960 
1020 
1080 
1140 
1200 
12<J0 
!3io 



s leu Phe y P G1 

35 ^ VI A / a w ^ _ 30 T hr 



33 °* ^ /a Arg Mefc 
so Pro «u Ly J 4S *ya Glu 



- * - - «. U, ;: u - ~ - "s - - 



80 



r / • Ln Gin t 8 

115 «- «, 7 »» Pro ^ «. to 

"° /" - «« v« ^ a t *« 
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Ala Asp Asp He He Ala Thr Leu Ala Ala Lys Gly Cys/fC Phe Phe 
115 120 12 g/ 

Asp Glu lie Phe He He Thr Gly Asp Lys Asp Met Zu Gin Leu Val 

135 14QT 

Asn Glu Lys He Lys Val Trp Arg He Val Ly S/ Gly He Ser Asp Leu 

160 

Glu Leu Tyr Asp Ser Lys Lys Val Lys Glu /rg Tyr Gly Val Glu Pro 



165 



170/ 



175 



His Gin lie Pro Asp Leu Leu Ala Leu jL Gly Asp Asp He Asp Asn 
180 - 185 / 190 



He Pro Gly Val Thr Gly He Gly G^x Lys Thr Ala 



195 



200 



Val Gin Leu Leu 
205 



Gly Lys Tyr Arg Asn Leu Glu Ty/ He Leu Glu His Ala Arg Glu Leu 

* ±u 215 ' 



220 



Pro Gin Arg Val Arg Lys Ala/eu Leu Arg Asp Arg Glu Val 



230 



235 



Ala He 
240 



Leu ser Lys Lys Leu Ala Thr Leu Val Thr Asn Ala Pro Val Glu Val 



245 



250 



Asp Trp Glu Glu Met Ly/ Tyr Arg Gly Tyr Asp Lys Arg Lys 

2 6 0 ~ — 



265 



255 



Leu Leu 



270 



Pro He Leu Lys Gluieu Glu Phe Ala Ser He Met Lys Glu Leu Gin 
275 / 2 80 285 

Leu Tyr Glu Glu A/a Glu Pro Thr Gly Tyr Glu He Val Lys Asp His 
/ 295 300 

Lys Thr Phe Glu/Asp Leu He Glu Lys Leu Lys Glu Val Pro Ser Phe 
/ 310 315 320 

Ala Leu Asp Leu Glu Thr Ser Ser Leu Asp Pro Phe Asn Cys Glu IK 

/ 3 2 5 - 



330 



335 



Val Gly X y ser Val Ser Phe Lys Pro Lys Thr Ala Tyr Tyr He Pro 



345 



Leu His As Arg Asn Ala His Asn Leu Asp Glu Thr 



e 



B55 



360 



350 

Leu Val Leu Ser 
365 



Lys WLys Glu He Leu Glu Asp Pro Ser Ser Lys He Val Gly Gin 

I 375 380 

Asn Leu Lys Tyr Asp Tyr Lys Val Leu Met Val Lys Gly He Ser Pro 



390 



395 



400 
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val Tyr Pro His Phe Asp Thr „ et He Ala Ala Tyr LeuLeu Glu Pro 

405 410 / 



Asn Glu Lys Lys Phe Asn Leu Glu Asp Leu Ser 
420 



425 



415 

Leu Lys Phe Leu Gly 
430 



Tyr Lys Met Thr 
435 



(2) INFORMATION FOR SEQ ID NO: 19 : 



(i) SEQUENCE 

(A) LENGTH: 687 base p/irs 

(B) TYPE: nucleic acd 

(C) STRANDEDNESS : bo) 

(D) TOPOLOGY: both 



CHARACTERISTICS/; 



£h 



(ii) MOLECULE TYPE : DNA /genomic) 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 19 
TCGTACCNGG GNTCNCNANA /cGACTGCAG CATGCAAGCT 

AAAGTGTAAA GCCTGGGg/g CCTAATGAGT GAGCTAACTC 
ACTGCCCGCT TTCCAGT^GG GAAACCTGTC GTGCCAGCTG 
CGCGGGGAGA GGCGg/tTGC GTATTGGGCG CTCTTCCGCT 
GCGCTCGGTC GTT^GCTGC GGCGAGCGGT ATCAGCTCAC 
ATCCACAGAA TC^GGGGATA ACGCAGGAAA GAACATGTGA 
CAGGAACCGT T/(aAAAGGCC GCGTTGCTGG GCGTTTTTCC 
CGAGCATCAC /aAAAATTCGA CGCTTCAAGT TCAGAGGTGG 
TAAAGATTAdf CAGGGCGTTT TCCCCCTGGG AAGCTNCCTT 
aacctggcc/g GTTTAACCGG GATACCNGNT CGGCCTTTTN 
GGGGNTT^TN GNAAAANGCT AAGGGTT 
(2) INFORMATION FOR SEQ ID NO: 20: 



GGCTAATCAT 
AACATACGAG 
ACATTAATTG 
CATTAATGAA 
TCCTCGCTCA 
TCAAAGGCGG 
GCAAAAGGCC 
ATAGGCTCCG 
GCGAAACCCG 
CGTGCGCTCT 
TCCCCTTNGG 



GGTCATAGCT 60 

CCGGAAGCAT 120 

CGTTGCGCTC 180 

TCGGCCAACG 240 

CTGACTCGCT 300 

TAATACGGTT 360 

AGCAAAAGGC 4 2 0 

CCCCCCTTGA 4 80 

ACAGGGACTA 540 

CCTGTTCCCG 600 

GGGAANCCTT 660 
687 



t 



-77- 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 701 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ lfc> NO: 20: 
GCTCGTACCG GGGATCTNNN ANATCGACTG C&GCATGCAA 
TAGCTGTTTC CTGTGTGAAA TTGTTATCCg/cTCACAATTC 
AGCATAAAGT GTAAAGCCTG GGGTGCCT#5\ TGAGTGAGCT 
CGCTCACTGC CCGCTTTCCA GTCGGGA^VC CTGTCGTGCC 
CAACGCGCGG GGAGAGGCGG TTTGCfSTATT GGGCGCTCTT 
TCGCTGCGCT CGGTCGTTCG GCTGjCGGCGA GCGGTATCAG 



CGGTTATCCA CAGAATCAGG GGATAACGCA GGAAAGAACA 

AAGGCCAGGA ACCGTAAAAA GGCCGCGTTG CTGGGCGTTT 

CTCANGAGCA TCANAAAAaV CGANGCTCAN _GTCANAGGTG 

TAAAAGATNC CCAGGCGt/t TCCCCCCTGG GAAGCTCCCT 

GGNNCCCTGN CCGGNTT^CC GGGGATAANC TTGTTCCGGN 

ANGGTGGGGG GTTTTWTNNA AAAGGCTCAA AGGCTGGTAN 

(2) INFORMATIOn/fOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) /LENGTH: 717 base pairs 
(B^TYPE: nucleic acid 
(CyJ STRANDEDNESS : both 
(©) TOPOLOGY: both 

(ii) MOLECULE TYPE: DNA (genomic) 



GCTTGGCGTA 
CACACAACAT 
AACTCACATT 
AGCTGCATTA 
CCGCTTCCTC 
CTCACTCAAA 
TGTGAGCAAA 
TTTCCATAGG 
-GCGAAACCCG 
CGTGGGGCTC 
CTTTNTCCCC 
G 



ATCATGGTCA 

ACGAGCCGGA 

AATTGCGTTG 

ATGAATCGGC 

GCTCACTGAC 

GGCGGTAATA 

AGGCCAGCAA 

CTCCGCCCCC 

AGAGGNCTAT- 

TCCTGGTTNC 

TTCNGGGAAA 



60 
120 
180 
240 
300 
360 
420 
480 
-54-0" 
600 
660 
701 



(xii SEQUENCE DESCRIPTION : SEQ ID NO: 21: 
GNTNTAGNNN GGNCTAANNG GCGGGGAAAT CGAGCTCGGT ACCCGGGGAT CCTCTAGAGT 



60 



• 
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CGACCTGCAG 

GTTATCCGCT 

GTGCCTAATG 

CGGGAAACCT 

TGCGTATTGG 

TGCGGCGAGC 

GATAACGCAG 

AAGGCCGCGT 

AATCGACGCT 

GCCGTTTCCC 

TTACCNGGAT 



GCATGCAAGC 

CACAATTCCA 

AGTGAGCTAA 

GTCGTGCCAG 

GCGCTCTTCC 

GGTATCAGCT 

GGAAAGAACA 

TGCCTGGCGT 

CAAAGTCAAG 

CCTGGAAGCT 

NCCTGTCCGC 



(2) INFORMATION FOR SEQ ID NO: 22: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 713 bade pairs 

(B) TYPE: nucleicAcid 

(C) STRANDEDNESS/ both 

(D) TOPOLOGY: bo4h 

(ii) MOLECULE TYPE: JDNA (genomic) 




120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

717 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 
NNNNCNNNNG GCTGANAfiCG ATAAATCGAG CTCGGTACCC GGGGATCCTC 
CTGCAGGCAT GCAAG(/tTGG CGTAATCATG GTCATAGCTG TTTCCTGTGT 
TCCGCTCACA ATTC&ACACA ACATACGAGC CGGAAGCATA AAGTGTAAAG 
CTAATGAGTG AOfcTAACTCA CATTAATTGC GTTGCGCTCA CTGCCCGCTT 
AAACCTGTCG /gcCAGCTGC ATTAATGAAT CGGCCAACGC GCGGGGAGAG 
TATTGGGCGGT TCTTCCGCTT CCTCGCTCAC TGACTCGCTG CGCTCGGTCG 
GCGAGCGGfrA TCAGCTCACT CAAAGGCGGT AATACGGTTA TCCACAGAAT 
CGCAGG^AAG AACATGTTGA GCAAAAGGCC AGCAAAAGGC CAGGAACCGT 



TAGAGTCGAC 
GAAATTGTTA 
CCTGGGGTGC 
TCCAGTCGGG 
GCGGTTTGCG 
TTCGGCTGCG 
CAGGGGATAA 
AAAAAGGCCG 



60 
120 
180 
240 
300 
360 
420 
480 




CGTTTGCTGG CGTTTTTCCC ATAGGCTCCG CCCCCCTTGA C^AACCATCA CAAAAATCGA 
CGCTCAATTC AGAAGTTGGC GAAAACCCGA CAGGACTAA/ AAAGATACCC AGCGfTTTCCC 
CCCCTGGAAA CTCCCCTCCG TTGCGCCTCT CCCTGTT^CC GAACCTTGCC CGCTTACCGG 
GAATACCTTG TCCNCCTTTT CTCCCCTTCC GGGAa/cGTT NGCGCCTTTC CCC 



540 
600 
660 
713 



